AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Meeting transcription optimization

# Meeting transcription optimization

Diar Sortformer 4spk V1
An end-to-end speaker diarization model based on the Sortformer architecture, which resolves permutation issues in diarization by ordering speech segments according to speaker arrival time, supporting recognition of up to 4 speakers.
Audio Processing
D
nvidia
385.49k
36
Segmentation 3.0
MIT
This is a speaker segmentation model based on pyannote.audio, capable of detecting speech activity, speaker changes, and overlapping speech.
Audio Processing
S
tensorlake
387
1
Pyannote Speaker Diarization 31
MIT
Pyannote.audio's speaker diarization pipeline for automatic detection and segmentation of different speakers in audio
Audio Processing
P
collinbarnwell
835
3
Segmentation 3.0
MIT
This is a powerset-encoded speaker diarization model capable of processing 10-second audio clips to identify multiple speakers and their overlapping speech.
Speaker Analysis
S
pyannote
12.6M
445
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase